621 research outputs found
ON MONITORING LANGUAGE CHANGE WITH THE SUPPORT OF CORPUS PROCESSING
One of the fundamental characteristics of language is that it can change over time. One
method to monitor the change is by observing its corpora: a structured language
documentation. Recent development in technology, especially in the field of Natural
Language Processing allows robust linguistic processing, which support the description of
diverse historical changes of the corpora. The interference of human linguist is inevitable as
it determines the gold standard, but computer assistance provides considerable support by
incorporating computational approach in exploring the corpora, especially historical
corpora. This paper proposes a model for corpus development, where corpus are annotated
to support further computational operations such as lexicogrammatical pattern matching,
automatic retrieval and extraction. The corpus processing operations are performed by local
grammar based corpus processing software on a contemporary Indonesian corpus. This
paper concludes that data collection and data processing in a corpus are equally crucial
importance to monitor language change, and none can be set aside
PROPER WORDS TO COMMON WORDS CONVERSION: THE FAMOUS, THE INFAMOUS AND THE GROWTH OF INFORMAL LEXICON
This paper seeks to describe the conversion (one of word formation processes) from proper words to
common words in Indonesian. By the time this paper was written, JONRU ‘to lie’ and TEDJO ‘to be
reckless’ were quote infamous: the frequency of occurrence was quite productive especially in social media.
In terms of the morphological forms, the word formation follows the affix formation in Indonesian. The
semantic features of these words refer to the negative prosodies that these words carry, which seem to root
from how people assess their quality. I argue that the conversion is aimed at performing evaluation on
personal quality regardless of its morphological forms. The trigger of this phenomenon seems to be the
events where mass media mostly pay attention to; and this is commonly negative in meaning. Consider also
di-munirkan ‘to be assasinated’ and di-gusdurkan ‘to be impeached’ where MUNIR (a human right activist)
was assassinated and GUSDUR (4
th
Indonesian President) was impeached. There are some other cases
when they do not specifically relate to the evaluation but using the person’s fame to make it widely used
such as SUDOMO, which is an acronym of Susu Doi Montok (sexy breasts). However, it is important to
realize that the objects of the research in this paper 1) are not used in formal situation 2) are productive in
a short span of time and 3) are understood in condition that the knowledge is shared among speakers
ON THE CHOICE ADDRESS FORMS: INTIMATE ADDRESS FORMS AS IN-GROUP IDENTITY MARKERS OF BLACK SOUTH AFRICANS IN 'Invictus' MOVIE
Invictus is a movie which is adapted from a true story of how the South African President, Nelson Mandela, tried to unite South Africa by supporting the national rugby team, Springbok, which used to be the symbol of Apartheid. His relation with other characters in this movie is reflected from the address forms and the choice is influenced by many aspects like social distance among the participants, age difference, formality scale etc. This paper focuses on the choice of address forms used among the participants. Besides the variety and progressive use of address forms, the finding also shows that the choice of address forms used by 'black' and 'white' South Africans are quite different, especially those addressed to Mandela. I observed that some in-group identity markers among the blacks are realized by intimate address forms like Mandela's nickname Madiba, or companion address Comrade. In the past, these address forms are known as the devices to struggle against the Apartheid system
ANNOTATION MODEL FOR LOANWORDS IN INDONESIAN CORPUS: A LOCAL GRAMMAR FRAMEWORK
There is a considerable number for loanwords in Indonesian language as it has been,
or even continuously, in contact with other languages. The contact takes place via different
media; one of them is via machine readable medium. As the information in different languages
can be obtained by a mouse click these days, the contact becomes more and more intense. This
paper aims at proposing an annotation model and lexical resource for loanwords in
Indonesian. The lexical resource is applied to a corpus by a corpus processing software called
UNITEX. This software works under local grammar framewor
TRANSDUCER FOR AUTO-CONVERT OF ARCHAIC TO PRESENT DAY ENGLISH FOR MACHINE READABLE TEXT: A SUPPORT FOR COMPUTER ASSISTED LANGUAGE LEARNING
There exist some English literary works where some archaic words are still used; they are relatively distinct from Present Day English (PDE). We might observe some archaic words that have undergone regular changing patterns: for instances, archaic modal verbs like mightst, darest, wouldst. The –st ending historically disappears, resulting on might, dare and would. (wouldst > would). However, some archaic words undergo distinct processes, resulting on unpredictable pattern; The occurrence frequency for archaic english pronouns like thee ‘you’, thy ‘your’, thyself ‘yourself’ are quite high. Students that are Non-Native speakers of English might come across many difficulties when they encounter English texts which include these kinds of archaic words. How might computer be a help for the student? This paper aims on providing some supports from the perspective of Computer Assisted Language Learning (CALL). It proposes some designs of lexicon transducers by using Local Grammar Graphs (LGG) for auto-convert of the archaic words to PDE in a literature machine readable text. The transducer is applied to a machine readable text that is taken from Sir Walter Scott’s Ivanhoe. The archaic words in the corpus can be converted automatically to PDE. The transducer also allows the presentation of the two forms (Arhaic and PDE), the PDE lexicons-only, or the original (Archaic Lexicons) form-only. This will help students in understanding English literature works better. All the linguistic resources here are machine readable, ready to use, maintainable and open for further development. The method might be adopted for lexicon tranducer for another language too
LINKING METAPHORS AND ARGUMENTS TO SEMANTIC PROSODIES: A CASE STUDY OF VICTORY VERBS IN INDONESIAN ONLINE FOOTBALL NEWS
The use of hyperbolic victory verbs such as menghancurkan ‘to destroy’, menekuk ‘to fold’, menggunduli ‘to shave bald’ characterizes football news report in Indonesia. These verbs are used in the specific domain; therefore, suggesting that they need further examination. The objectives of this research are 1) to map metaphor classes and the arguments of these verbs and 2) to confirm whether the metaphor classes and the arguments are determinant to the semantic prosody of these verbs. Texts under football domainthat contain victory verbs were collected from different online news portals. The examination of victory verbs resulted on 10 affix formations and 10 different metaphor classes. Of these victory verbs, the frequent semantic roles are (the victors), (the defeated teams) and (the victories). The identification of the semantic prosody has shown that affix formation is fairly distributed and not significantly correlate to prosody. However, there is a strong tendency that metaphor class with negative nuance (like +DESTRUCTION, +WAR, +FIGHT) and the presence of an argument that takes semantic role suggests negative semantic prosody. They might be major cues to prosody in this data, but reexamination on a terminal level is still required to formalize this description, as some exceptions and irregularities are also presen
ON THE CHOICE ADDRESS FORMS: INTIMATE ADDRESS FORMS AS IN-GROUP IDENTITY MARKERS OF BLACK SOUTH AFRICANS IN ‘Invictus’ MOVIE
oai:teflin.ejournal.web.id:article/10Invictus is a movie which is adapted from a true story of how the South African President, Nelson Mandela, tried to unite South Africa by supporting the national rugby team, Springbok, which used to be the symbol of Apartheid. His relation with other characters in this movie is reflected from the address forms and the choice is influenced by many aspects like social distance among the participants, age difference, formality scale etc. This paper focuses on the choice of address forms used among the participants. Besides the variety and progressive use of address forms, the finding also shows that the choice of address forms used by ‘black’ and ‘white’ South Africans are quite different, especially those addressed to Mandela. I observed that some in-group identity markers among the blacks are realized by intimate address forms like Mandela’s nickname Madiba, or companion address Comrade. In the past, these address forms are known as the devices to struggle against the Apartheid system
IN SEARCH OF PRONOUN EQUIVALENCE: CHALLENGES IN TEACHING INTERPRETING
There are some challenges in Interpreting; in this study, we focus more on the interpreting of pronouns from English as Source Language (SL) to Indonesian as Target Language (TL). The data was collected within simulations in classroom setting. The results showed many unnatural translation of pronoun ‘it’ and ‘this’. References of these pronouns are anaphoric and may take different linguistic units (clause, phrase, word). There were some strategies employed to achieve natural interpreting in TL. Some students repeated the reference(s); but they need to recover the whole references (might be very challenging when the reference(s) are complex). Other strategies are partial recovery, or lexical addition that complies with TL structure. There are some suggestions for an interpreter to deal with the challenges; one of them is to be familiar with the topic of what they are interpreting. In this way, they do not have to rely on short-term memory skills to decide which strategy to employ
Current Implementation and Future Prospects of Santi-Morf V.1.0
SANTI-Morf (Prihantoro, 2021) is a new morphological analyser for Indonesian. In SANTI-Morf annotation scheme (Prihantoro, 2019), morpheme tokens are linked to their annotations. The tokens are presented in their orthographic and citation forms to allow (allo)morph or morpheme-based searches. Users can also perform retrievals on the basis of formal and functional morphological criteria as SANTI-Morf tagset encodes the analyses of morphemes’ forms (e.g. roots, clitics, affix type) and functions (e.g. passive voice, active voice, adjective degrees, etc.). Currently, the scheme is implemented in Nooj (Silberztein, 2003), a linguistic development environment. It enables users to index and annotate Indonesian texts in their local PC, and later perform searches based on morphological criteria and or tokens defined by the SANTI-Morf scheme. AbstrakSANTI-Morf (Prihantoro, 2021) adalah sebuah program analisis morfologi terbaru untuk bahasa Indonesia. Dalam skema anotasi SANTI-morf (Prihantoro, A new tagset for morphological analysis of Indonesian, 2019), setiap token morfem terhubung dengan anotasinya. Token-token ini direpresentasikan dalam bentuk ortografis dan bentuk sitasi sehingga memungkinkan pengguna untuk melakukan penelusuran berbasis (alo)morf atau morfem. Selain itu, pengguna juga bisa melakukan penelusuran berbasiskan bentuk atau fungsi morfem. Ini karena tagset analitik yang digunakan di SANTI-morf mencakup bentuk (di antaranya: akar, klitik, jenis afiksasi) dan fungsi (di antaranya: aktif, pasif, derajat ajektiva). Saat ini, SANTI-morf diimplementasikan menggunakan NooJ (Silberztein, 2003), sebuah program pengembangan aplikasi linguistik. Pengguna dapat mengindeks dan menganotasi teks berbahasa Indonesia di komputer mereka, dan selanjutnya melakukan penelusuran menggunakan kriteria morfologi dan skema tokenisasi yang digunakan di skema anotasi SANTI-morf
RUANG NORMAL DALAM TOPOLOGI
Dalam tulisan ini
later belakang pembicaraan
adalah ruang topologi . Pada ruang topologi akan di
normal
ruang normal ini akan dikaitkan dengan
misalnya : regular ,pa sebagainya .
definisi-definisi yang
berkaitan dengan ruang topologi , ruang normal. , regu
lar dan sebagainya juga berikan,contoh-contoh.
9
right owner.(s) agree•that UNDIP-IR may, withOl urpose of preservation. The author(s) or copyright purpose of security, back-up and Preservation:- •
This document- is Undip Institutional Repository Collection. .1 The .o i• ,op
changing the content, translate the submission to:any medium or format for the owner(s) also agree that UNDIP-IR may keep more than one copy of this '5ubmission fOr
http://eprintiL:rdtp.ac.id
- …